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published in: Biosystems. 2008 Dec;94(3):248-52 



M. Wojciechowski and Marek Cieplak 
Institute of Physics, Polish Academy of Sciences, Al. Lotnikow 32/46, 02-668 Warsaw, Poland 

We perform molecular dynamics simulations for a simple coarse-grained model of crambin placed 
inside of a softly repulsive sphere of radius R. The confinement makes folding at the optimal 
temperature slower and affects the folding scenarios, but both effects are not dramatic. The influence 
of crowding on folding are studied by placing several identical proteins within the sphere, denaturing 
them, and then by monitoring refolding. If the interactions between the proteins are dominated 
by the excluded volume effects, the net folding times are essentially like for a single protein. An 
introduction of inter-proteinic attractive contacts hinders folding when the strength of the attraction 
exceeds about a half of the value of the strength of the single protein contacts. The bigger the 
strength of the attraction, the more likely is the occurrence of aggregation and misfolding. 



I. INTRODUCTION 



There is a growing interest in studies of biomolecules enclosed within a limited space. One reason is that almost all 
life processes t ake place in compartme nts such as cells where concentrations of proteins, lipids, shugars, and nucleic 
acids are large (jEllis and Mintonl . 120061 ). Such condi tions are also desired in artificial life syste ms such as the liposomes 
that allow for protein synthesis within their interior (jMurtas et al. 1 . 120071) . Chaperonin cages (jHartl and Hayer-Hartl . 
2002| ). that assist in folding and refolding processes of proteins, offer an example of compartmentalization at a still 
smaller length scale. Another reason for the interest in the confinement effects is provided by recent advances in 
nanotechnology and resulting novel encapsulation techniques. These involve, for instance, reverse micelles which are 
mimetic systems of biological membranes composed of amphiphilic molecules. Thes e molecules self-orga-nize so that the 



polar head-groups point inward and hydrocarbon chains face the organic solvent (jLuisi et al 



1992 



Melo et al. 



1988 



Matzke et al. 



20031 ). The amount of the entrapped water is controlled by experimental conditions and a typical 



radius of the correspo nding sphere can be as small as ~20 A. The water molecules at the inner surface have a 



propensity to organize (jMoilanen et al. 1 . 120071 ) and the conditions within need not be uniform (jBaruah et al.l ■ l2006^ . 



When it comes to larger confined systems, there are many microfluid ic ways to deposit droplets on surfaces, e.g. in 
the context of the protein and DNA microarrays (jPuroux et al.l . l2007l ). 

It is thus interesting to unde rtake theoretical studies of proteins that are confined. A simple way to in troduce 



confinement is through a sphere (jBaumketner et al.l . 12003 ; 



Rathore et al 



20061) ■ or a cage (|Takagi et al.l . l2003[) . which 



are repulsive to proteins located on the inside. A sphere which has attractive hydrophobic and repulsive hydrophilic 
patches on the inside has been also discussed (jjewett et al.l . 120041 ) to elucidate the workings of chaperonins. One 
can also generate ca vities by using many spheres, repulsive on the outside, to immitate the effects of crowding 
(jCheung et al.l . l2005f ). Most of the studies carried out so far have been focused on thermodynamics. The confinement 
has been found to lead to a greater thermodynamic stabifity, br oader and taller specific heat and more compact 



even further (jCheung et al 



unfolded con formations ([Rathore et al 

I2OO5I ). 



2006 



Takagi et al.l . 120031 ). Crowding is expected to enhance these effects 



In t his paper, we consider the kinetics of folding o f a protein. This pro blem has already been studied by, Baumketner 
et al. ( Baumketner et all 2003 ) and Jewett et al. ( Jewett et al. . 2004). In the case of the confining repulsive sphere 



(Baumketner et al 



20031 ). the wall potential was represented by 
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where Rs is the radius of the sphere, r is the distance of a C" atom from the center of the sphere, e^jaii is the strength 
of the potential, and a is take to be equal to 3.8A, i.e. to the distance between two consecutive C" atoms in a protein. 
The folding time, determined at various temperatures, has been found to depend on R^ in a complicated manner. 
For instance, at temperatures below the optimal temperature it decreases with in creasing the Rg, but it increases 
above this temperature. In the case of the non-uniform sphere (jjewett et al.L 120041 ) the physics involved depends on 
the strength of attraction to the hydrophobic patches in the model. If the attractive patches act as strongly as the 
hydrophobic interactions in the protein, the protein sticks to the wall and folding is arrested, i.e. it takes forever. A 
reduction in the strength of the attraction leads to a lowering of the folding time until a minimum is reached and 
then the folding time increases to a finite value when the wall becomes purely repulsive. 

Here, we focus on the effects of confinement and crowding within a softly repulsive sphere. We investigate the 
kinetics of folding of a single protein as a function of the radius of the sphere and then also as a function of the 
interactions between atoms belonging to different proteins when several proteins are confined together. In order to 
house the proteins, we consider a wall represented by the truncated and shifted Lennard- Jones potential 
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where R will be referred to as the radius of the sphere, and a — ■ 2^i. We take ro— 4 A (which is equal to the size 
of the repulsive core in the non- native contacts as defined below) . The specific form of a purely repulsive potential 
representing the sphere should not matter much for th e kinetics of folding . How ever, what may matter more is the 
choice of a model of the protein. Bau mketner et al. (Baumketner et al.L 120031 ) use a 27-bcad minimalistic model 
proposed by Honeycutt and Thirumalai (jHonevcutt and Thirumalail . 119921 ) in which pair- wise interactions depend on 
whether the amino acids involved are hydrophobic or polar. 

We use an other simple coarse-grained model, a Go-like model (IGo and Abd. Il981r). in an implementation devel 
oped in refs. (ICieplak and Hoane 



Kwiecinska and Cieplakl . 



2005; 



2003; 



Cieplak et al 



Szvmczak and Cieplak . 



2004; 



Cieplak and Sulkowskal . l2005t iHoang and Cieplak . 



200C; 



20061 ) and perform molecular dynamics studies of folding and 



unfolding. The Go-like models are rather imperfect tools to use in the context of folding, but are often found to be 
adequate to settle various qualitative issues, especially of a comparatory nature. Their advantage is that they allow 
for a thorough statistical analysis of time dependent processes involving large conformational changes. 

As an illustration of this approach, we consider crambin. This is an a — /? protein comprising of 46 amino acids. In 
its native state, the radius of gyration is about 9.7 A and the largest distance between a pair of its C" atoms is 30 A. 
The minimum value of R that does not violate the steric constraints and still allows for meaningful conformational 
transition is 18 A and the corresponding plot of the Vwaii potential is shown in Figure (TJ together with a schematic 
representation of the native conformation. 

We find that confinement under optimal folding conditions makes folding last longer but not more than by a factor 
of 2. We then consider the effects of crowding by placing up to twelve identical proteins inside of the sphere and 
studying refolding of the thermally denatured conformations. If a protein acts on another protein only through their 
excluded volumes, then the folding process, at optimality, is almost the same as for the single protein case. If one 
introduces attractive contacts between the proteins then, above their certain strength, the folding is hindered more 
substantially, even under the optimal conditions, since the conformational collapse now competes with aggregation. 



II. METHODS 



The details of the approach are explained in refs. (jSulkowska and Cieplak 1 . 120071 120081 ). Briefly, each amino acid is 



represented by a bead located at the C" position. The beads are tet hered into a chain by the harmo nic interactions. 
The local backbone stiffness is represented by a chirality potential (|Kwiecinska and Cieplakl . 120051 ) that favors the 
native sense of the chirality, i.e. the native values of the dihedral angles. The interactions between the amino acids are 
divided into the repulsive non-native contacts and attractive native contacts. The division is based on the absence or 
presence of the atomic overlaps in the experimentally determined native conformation. The attractive native contacts 
between amino acids i and j in distance Vij are described by the Lennard Jones potential V^"' = 4£((^)^^ — (^)^), 
where the length parameter dy is determined so that the minimum in the potential agrees with the distance determined 
experimentally. The energy parameter e should be of order 1 - 1.6 kcal/mol. We set Swaii to be equal to e. The model 
incorporates implicit solvent effects through temperature controlled random forces and strong velocity dependent 
damping. Room temperature situations are considered to arise in the vicinity of fc^r/e of order of 0.3 (fc^ is the 
Boltzmann constant and T is the temperature). The time scale, r, in the molecular simulations is of order of 1 ns 
as it is set by time needed to cover distances of order of a typical through diffusion and not through a ballistic 
motion. 

When studying folding, we determine the median time, tfoid, needed to establish all native contacts for the first 
time when starting from an unfolded conformation. A native contact is assumed to be established when < 1.5 tiy . 
tfoid is determined based on at least 301 trajectories. A convenient way to represent the sequencing of the folding 
events is through the scenario diagrams in which one shows average times when a specific contact gets established for 
the first time. The contacts are labelled by their sequential distance \i — j\. 

Simulations of folding require a prior generation of extended denatured structures to fold from. These are obtained 
by unfolding the native structure by applying a high temperature. Time scales required to arrive at a state in which 
all native contacts are broken are too long to achieve in the simulations. Instead , we ta ke the criterion of all contacts 
with \i — j| > 5 being broken (see a discussion in rcf. (jSulkowska and Cieplak 1 . 12007^ ). The corresponding median 
unfolding time is denoted by t^nf ■ 

Confinement puts restrictions on the possible starting conformations since they must fit the sphere. Thus we 
generate the starting sets by first placing a protein (or proteins) in a native state in a sphere of radius i?o i then setting 
/csT/e at 1.0, and finally by storing conformations obtained at the end of a lOOOOr-long process of the unfolding 
dynamics (the corresponding t^nf ranged between 100 and 2000 r depending on Rq). One can simulate refolding 
either for R — Rq, which is the simplest situation, but one may also consider refolding of the i?o-generated structures 
in a larger space when R > Rq and, in particular, for R — oo. 

The resulting unfolded structures are governed by the value of Rq as shown in Figure [2] which provides structure 
characterization through the average radius of gyration, < Rg >, and the average fraction of the native contacts that 
are still present, Q. Notice that even for very large values of Rq there is always a small fraction of unbroken native 
contacts. These are usually associated with the a helices. 

In the case of ti > 1 molecules {n up to 12 was considered), we place them together at the center of the sphere 
and then move them away from one another in small steps along arbitrary directions until they stop overlapping (see 
Figure [3]). The system is then unfolded thermally for 10000 r and the resulting structures (see Figure 2]) are used 
for refolding studies, tfoid for n molecules was determined by calculating the folding time of the individual molecules 
and then by taking the median over the molecules and over various trajectories. Each trajectory was stopped either 
when a time cutoff was exceeded or when each molecule was declared to arrive at the native conformation at some 
point during the evolution. Another possible criterion would involve requiring a simultaneous establishment of all 
native contacts in the system. This would yield folding times that are significantly longer and comparing systems 
with different values of n would not be relevant in the context of crowding since the behavior of a single molecule is 



what is of the interest here. The scenarios of folding were reahzed hke in the case of one molecule - just the averaging 
gets enhanced by the factor of n. 



III. FOLDING OF A SINGLE PROTEIN 

Figure [5] shows tfoid as a function of T for the three choices of Rq: 18, 30, and 50 A. The left panel corresponds 
to refolding in an unrestricted space whereas the right panel to refolding in the sphere with the original radius of 
confinement. In the unrestricted space, tfoid is nearly independent of Rq. On the other hand, the presence of the 
sphere makes a change. There is not much of a difference in folding between Rq = R = 30 K and 50 A, but there is 
a noticeable increase in tfoid as one considers smaller values of R: tjoid increases from 150 to 200 r when R—18 A is 
considered. The differences become more visible when comparing folding from starting conformations corresponding 
to a given Rq and then evolved in the sphere or in the unrestricted space. For i?o=18 A, the change is from 137 r 
to 200 T, i.e. by a factor of 1.5. We interpret this finding as originating from the wall exerting a restriction on the 
process of the collapse: the protein may bounce with the wall on the way to its globular form and may need time to 
find another path. 

Figure [H] shows that the folding scenarios show more substantial sensitivity to the value of Rq of the starting 
conformations than the folding times do (both for the unrestricted refolding, shown in the left panel, and for R — Rq). 
The observation is that the tighter the initial confinement, the faster the establishment of the individual native 
contacts. The complete folding, however, requires a simultaneous establishment of all contacts and this circumstance 
is sensitive to confinement to a much lesser degree. 



IV. FOLDING OF A SEVERAL PROTEINS 



We now discuss the effects of crowding and we place n crambin molecules together into a sphere of radius i?=36 A. 
n varies between 1 and 12. We monitor the folding process when R=Rq. We first consider the simplest case in which 
the only way one molecule knows about the other is through the finite size of the hard cores (vq = 4 A) associated 
with the beads. Fig [7] shows that under the conditions of optimal folding, tfoid essentially does not depend on n, i.e. 
when folding goes well, processes in one molecule do not affect movements of the other. 

We now repeat these calculations after introducing attractive interactions between the molecules. Since these 
molecules are all identical, the simplest way to do it is by introducing inter-protein attractive contact interactions in 
the following way. If there is a contact between i and j in one molecule A then we generate a similar contact between 
i\ in molecule A and in molecule k. Each intermolecular contact is assigned an amplitude of ej in the corresponding 
Lennard- Jones potential. We vary ej/e between and 1. The value of (Ty is kept the same as for the single protein. 
The corresponding potential is given by 
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where the prime in j' indicates a different molecule. 

The resulting folding times are shown in Figure [51 It is seen that tfoid is not affected by the inter-protein attraction 
as long as e//e does not exceed a treshold value and then essentially all trajectories fold. An example of a folded 
conformation is shown in Figure [5] For larger values of e/, the molecules influence one other which makes tfoid longer 
and increases the frequency of the misfolding events, as shown in the inset of Figure [51 The threshold value decreases 
with n. For n=4 it is ~ 0.5 and for n=12 it is 0.3. 



The misfolding is primarily due to aggregation and entanglement. An example of a misfolded conformation is shown 
in Figure [TUl In most trajectories, the secondary structures (like helices 7-19 and 23-30) in individual molecules are 
established the first. At this stage, misfolding may occur since the intermolecular interactions tend to bring the 
secondary structures from various molecules together, generating a steric hindrance to further folding. 

It should be noted that the presence of a substantial ej also affects the folding scenario as seen in Figure [TT] for 
ej/e—O and 0.7 - the case for which tfoid varies by a factor of about 6. It is seen that the inter-protein interactions delay 
establishment of the contacts as they compete with the internal interactions. However, if the contact establishment 
times are normalized to t fold to bring out the relative changes in the structure of the scenario diagram then such 
relative times get shorter. 



V. CONCLUSIONS 

Our studies within the simple model employed here suggest that both confinement and crowding can affect the 
folding process of a protein. The confinement related effects on the folding time are weak unless there are attractive 
interactions between the proteins. It is also expected that increasing the number of proteins in a given sphere (n 
bigger than 4) will eventually affect folding significantly. This point remains to be studied further. The crowding 
effects may get enhanced further when one a ccounts for the hydrodynami c interactions in the system. One way to 



include these interactions is discussed in ref. ljSzvmczak and Cieplakl . 120071 ). It would be worthwhile to check other 



model proteins to check for any universalities in the behavior. 

This paper is dedicated to Prof. Zbigniew Grzywna on his 60'th birthday and was motivated by a discussion with 
Nancy E. Levinger. The work involved has been supported by the grant N N202 0852 33 from the Ministry of Science 
and Higher Education in Poland. 
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FIG. 1: The potential of the spherical wall with f?=18 A used in this paper, r denotes the distance of a bead form the center 
of the sphere. 




FIG. 2: Geometrical characteristics of the unfolded structures of a single protein, as obtained for fcsT/e=1.0, as a function of 
the radius of the sphere in which they were generated. 



FIG. 3: Four model molecules of crambin in their native state placed in the sphere. 



FIG. 4: An example of an unfolded conformation of four proteins. 




FIG. 5: The median folding times for the model crambin as described in the main text. The left panel is for folding in an 
inifinitely large sphere. The values of Ro characterize the nature of the starting unfolded conformations. The right panel is for 
spheres of three sizes as indicated. 
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FIG. 6: The folding scenarios in the unrestricted space (the left panel) and confined space (the right panel) for Ro=18 A (the 
solid symbols) and 50 A (the open symbols). The temperature is set to ksT/s = 0.3 which corresponds to the temperature 
of optimal folding. The symbols ai + a2 and /3i + (^2 indicate contacts between the two a-helices and two /3-strands that are 
present in crambin. 




FIG. 7: Folding time for an ensemble of n proteins in the spherical cavity of R- 
between the proteins. The values of n range from 1 to 12 as indicated. 



;=36 A when there are no attractive interactions 




FIG. 8: Median folding time for n proteins at fcsT'/e=0.3 as a function of the inter-protein interaction strength sj. The inset 
shows the fraction of misfolded trajectories. 



FIG. 9: An example of a properly folded ensemble of four proteins for eije = 0.5. 



FIG. 10: An example ol a misfolded and entangled conformation of four proteins corresponding to sije = 0.8. 
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FIG. 11: The folding scenario for four proteins at kBT/e=0.3 for ej/e=0 (crosses) and 0.7 (open squares). 



